Max-margin structured output learning in L1 norm space
نویسندگان
چکیده
We study a structured output learning setting where both the sample size and dimensions of the feature vectors of both the input and output are very large (possibly infinite in the latter case), but the input and output feature representations are nonnegative and very sparse (i.e. the number of nonzero components is finite and their proportion to the dimension is close to zero). Such situations are encountered in real-world problems such as statistical machine translation. We show that in this setting structured output learning can be efficiently implemented. The solution relies on maximum margin learning of the linear relations between the inputs and outputs in an L1 norm space. This learning problem can be formulated by imposing L∞ norm regularisation on the linear transformation expressing the relations.
منابع مشابه
Maximum Entropy Discrimination Markov Networks
Standard maximum margin structured prediction methods lack a straightforward probabilistic interpretation of the learning scheme and the prediction rule. Therefore its unique advantages such as dual sparseness and kernel tricks cannot be easily conjoined with the merits of a probabilistic model such as Bayesian regularization, model averaging, and ability to model hidden variables. In this pape...
متن کاملSpectral Regularization for Max-Margin Sequence Tagging
We frame max-margin learning of latent variable structured prediction models as a convex optimization problem, making use of scoring functions computed by input-output observable operator models. This learning problem can be expressed as an optimization problem involving a low-rank Hankel matrix that represents the inputoutput operator model. The direct outcome of our work is a new spectral reg...
متن کاملMax-Margin Structured Output Regression for Spatio-Temporal Action Localization
Structured output learning has been successfully applied to object localization, where the mapping between an image and an object bounding box can be well captured. Its extension to action localization in videos, however, is much more challenging, because we need to predict the locations of the action patterns both spatially and temporally, i.e., identifying a sequence of bounding boxes that tr...
متن کاملMultilabel Structured Output Learning with Random Spanning Trees of Max-Margin Markov Networks
We show that the usual score function for conditional Markov networks can be written as the expectation over the scores of their spanning trees. We also show that a small random sample of these output trees can attain a significant fraction of the margin obtained by the complete graph and we provide conditions under which we can perform tractable inference. The experimental results confirm that...
متن کاملGibbs max-margin topic models with data augmentation
Max-margin learning is a powerful approach to building classifiers and structured output predictors. Recent work on max-margin supervised topic models has successfully integrated it with Bayesian topic models to discover discriminative latent semantic structures and make accurate predictions for unseen testing data. However, the resulting learning problems are usually hard to solve because of t...
متن کامل